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Abstract 

Finding relationships among difTerent indices such as ^-index, g-index, 
e-index, and generalized impact factor is a challenging task. In this pa- 
per, we describe some bounds and inequalities relating /i-index, g-index, 
e-index, and generalized impact factor. We derive the bounds and inequal- 
ities relating these indexing parameters from their basic definitions and 
without assuming any continuous model to be followed by any of them. 
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1 Introduction 

A lot of research is carried out by people working in different areas. Sometimes, 
one needs to evaluate the quality of the research produced by individual authors 
or groups of authors. The quality of research produced by authors is, generally, 
evaluated in terms a ranking parameter which is, generally, based on the number 
of citations received by the papers produced by the authors. There are many 
types of ranking parameters presented in the literature for evaluating the quality 
of research such as h-'mdex [12], g-index [S], e-index [13], and impact factor. 
Some of these parameters can also be used to evaluate the quality of research 
published by a journal such as /i-index, g-index, e-index, and impact factor. 
The impact factor in the long term becomes the average number of citations 
per pubhshed paper. This long term impact factor is termed as the generalized 
impact factor in [Tl . 

While one has computed an index for evaluating the quality of research, 
one would like get an indication about the other types of indices. To have 
such an indication, one needs to know how an index is related to other indices. 
The relationships among ft,- index, g-index, and e-index are described in |15j . 
However, in |15| . the indices are assumed to follow a continuous distribution. 



A relation between /i-index and impact factor is described in [5] using a power 
law model called the Lotka's model. An attempt to relate /i-index, g-index, 
and the generalized impact factor is made in [T]. The relationships, therein, 
contain only inequalities among the parts of the number of citations. The parts 
for the number of citations are not bounded by an expression, therefore, the 
relationships described therein seem to be trivial. However, it was an attempt 
to derive the relationships among the indices using their basic definitions and 
no distribution or a continuous model was assumed for any of the index. 

In this paper, we describe the bounds for the /i-index and g-index in terms of 
the indices and the generalized impact factor. We derive these bounds from the 
very basic definitions of the indices and the generalized impact factor without 
assuming any model or any continuous distribution to be followed by any of 
these indices. 

In what follows, we provide a brief overview of the indices and the generalized 
impact factor. 

2 Overview of Indices and Impact Factor 

In this section, we present an overview of impact factor and different types of 
indices. 



Suppose the papers are arranged in descending order of the number of citations. 
Let Ci be the number of citations of a paper numbered i. The /i-index [12j . when 
papers are arranged in descending number of their citations, can be defined as 
follows. 



By definition, /i-index is the largest number, h, such that the papers arranged 
in their decreasing order of citations have at least h number of citations. 



According to the definition of g-index, if the papers are arranged in the de- 
scending order of their number of citations, g is the largest number such that 
the summation of the number of citations is at least g^. In other words, when 
papers are arranged in descending order of their citations, g-index can be defined 
as follows. 



2.1 The /i-Index 



h = max(z) : a > i. 



(1) 



2.2 The 5f-Index 




(2) 



Note that g- index is the largest number i such that J2i Ci> i^. 



2.3 The e-Index 



The e-index is defined in [14! to serve as a complement for the ft,-index. The 
definition of e-index is as follows. 

h 
i=l 

= (3) 

Alternatively, ^ can be written as follows. 

Y^c, = h^ + e\ (4) 

1=1 

Remark: In the definitions of /i- index (as given by ([IJ) and that of g-index 
(as given by we have intentionally ignored the time T at which we are 
considering their values. This is done to keep their definitions simple, and 
defining so there is no loss of generality as far as the discussion in this work is 
concerned. For precise definitions of the indices incorporating the time, one is 
referred to [3] . The same is true for the e-index. 



2.4 Generalized Impact Factor 

Let Ci be the number of citations of ith paper published in a journal. Let there 
be total P papers published by the journal. The generalized impact factor [Tpl . 
which should better be called as the average number of citations per paper [1] or 
an impact factor without a time window constraint [T] or an impact factor with 
a time frame from the beginning of the publication of the journal till now or till 
the time of evaluation [T] or simply an impact factor [5] is defined as follows. 



Si=i 



P 



C 
P' 



(5) 



3 Analysis of Relationships 

In this section, we describe how indices and generalized impact factor are related 
to one another. 

better name for the generalized impact factor is average number of citations per paper 
or long term impact factor. 



3.1 Impact Factor, /i-Index and e-Index 

We state the following theorem that relates these parameters. 

Theorem 1. Let P he the number of papers published in a journal. The h-index, 
e-Index and impact factor are related by the following inequality. 

o2 



h > 



P 



(6) 



Proof. Using (O, the total number of citations of a journal can be written as 
follows. 



Y.c. = ifP 



(7) 



The citations appearing in the L.H.S. of ([7]) can be broken into two parts, one 
from 1 to h and the other from ft. + 1 to P, as given below. 



h P 
J2c,+ ^ C^^IfP 



Using dHI) and Q, we have, 



ft^ + + J2 ^JP- 

i=h+l 



Now, we have, 



Therefore, we have, 



Ch+i < h 
Ch+2 < h 
... < ... 

cp < h. 

p 

^ Ci < (P - h)h. 

i=h+l 

Using ([TT|) and (0), we have, 

h? + + {P - h)h < IfP 
e^ + Ph < IfP. 



In other words, we have, 



h> I 



f 



P 



Since /i is a whole number, therefore, we can write, 

o2 



h > 



P 



(8) 



(9) 



(10) 
(11) 

(12) 
(13) 



□ 



In other words, we can say that 



h = n 

where, denotes the lower bound. 



P 



(14) 



3.2 The ^f-Index, /i-Index, and e-Index 

We state the following theorem that provides an inequality relating these indices. 

Theorem 2. The h-index, g-index, and e-index are related with the following 
inequality. 



h> 



9 



9 



(15) 



Proof. Let the the papers are arranged in the descending order of their citations. 
From the definition of g-index, as given in we have. 



9 



max(z) : Ci > i^ 



(16) 



At i = (?, we have. 



(17) 



Breaking the number of citations in the L.H.S. of (|17p into parts, we have. 



i=l i=h+l 



From (m and (HH), we have. 



(18) 



In other words, 



Now, we have, 



h^ + e'^ + c» > ff^ 

i=h+l 



9'~{h' + e')< £ c. 

i=h + l 



Ch + 1 < h 

Ch+2 < h 

... < ... 

Cg < h. 



(19) 



(20) 



(21) 



Therefore, we have, 

9 

^ c,<{g- h)h. 

Using (gni) and dH]), we have, 

<?2 - {h^ + e^) < {g - h)h. 

Or, 



^2 „2 



Rearranging we have. 



5 



(22) 

(23) 
(24) 

(25) 



Since ah these indices, h, g, and e are integers, therefore, (j25p can be written 
as follows. 



h > 



e 

5 

9 . 



□ 



In other words. Theorem [2] provides a lower bound for /i- index in terms of 
the f;-index and the e-index. 



h = n 



9 



9 



(26) 



We have the following lemma that provides a bound for the g-index. 
Lemma 1. An upper bound for g-index is as follows. 

„2- 



9 = 



(27) 



Proof. From ([20|) . we have 



9'-{h' + e')< E c.. 

In ((2T|) . if we put g at the R.H.S. for /i + 1 < i < q < g, we get, 

s 



E Ci<{9~ h)g. 



i=h+l 



Therefore, from we have, 

.g2 - (h^ + e2) < „ 



(28) 



(29) 



Or, 
Or, 

This gives us, 



+ ~ gh> 0. 
+ > gh. 

g<h + —. 



Again, all these indices are whole numbers, therefore, we can write, 



< 



(30) 
(31) 

(32) 
(33) 



Alternatively, 



g^O 



□ 



We now prove another theorem that provides an upper bound for the g-index 
in terms of /i-index and e-index. 

Theorem 3. An upper hound for g -index in terms of h-index and e-index is as 
follows. 

g = 0{h + e). (34) 

Proof. Using ([22|) . we have, 

g"^ -gh-e^ < 0. (35) 

This resembles to the quadratic equation ax^ + 6a; + c = 0, whose roots are as 
follows. 



2 

1=1 



-6± V62-4ac 
2a ■ 



Here, we have, a — \, b = —h, c = —e^ , therefore, the only root for g-index is, 

h + + 4e2 
9 < ■ (36) 



Now, we know that {h + 2e)^ = h'^ + 4e^ + 4e/i. In other words, we have, 

h + Ae"^ < (h + 2e)^ . (37) 

This implies that 

V/i^ + 4e2 <h + 2e. (38) 
Using and we have. 



9 < 



h + {h + 2e) 



< /i + e. 



In other words, g = 0{h + e). 



(39) 
□ 



3.3 The /i-Index, g-Index, and Impact Factor 

We state the following theorem that relates these parameters. 

Theorem 4. The generalized impact factor, g-index, and h-index are related 
as per the following inequality. 



h > 



P-9 



Proof. From ([S]), we have, 



Breaking the number of citations in the L.H.S. of (|4ip . we have, 

i—l i—g^\ 



(40) 



(41) 



(42) 



From (|17p . we have, > g^ ■ However, we need to point out that at the 

point i — g,Yli=i '^i just exceeds g^. The amount by which J2i=i exceed 
g^ depends upon the value of c; at i = g, i.e. Cg. Therefore, for all practical 
purposes, we can assume J2i=i ~ 9^- ^ result, we have. 



Now, we have. 



Therefore, we have, 



5^ + E « IfP- 

i=g+l 



Cg + 1 < h 

Cg + 2 < h 

... < ... 

cp < h. 



E c,<iP-9)h. 



(43) 



Using (gSl) and (gl]), we have. 



Or, 

As a result, we get. 



g^ + {P~g)h>IfP. 
{P-g)h>IfP-g\ 



(44) 
(45) 

(46) 
(47) 



Since h is a whole number, therefore, ([15)1 



h > 



. P-9 



□ 



In other words. Theorem 2] states another lower bomrd for the ft,-index which 
is as follows. 



While proving Theorem 01 we come across the following observation. We 
could have taken on the R.H.S. of (jH]), the value of g-index rather than h- 
index. We have taken the value of h-mdex to make the lower bound on /i-index 
to be tight enough. To understand it better, let us take the value of g-index in 
pi)) . We now state a lemma that gives a lower bound (though loose) on the 
g-index. 



Lemma 2. A loose lower bound on the g -index is as follows. 

9 = ^{If) ■ 

Proof. In other words, we could have written (|44)) as follows. 

Cg+i < 9 

Cg+2 < 9 
... < ... 

CP < 9- 



(50) 



Therefore, we have, 



Putting it in (|43| . we get. 



J2 < (^ - 9)9- 

1=5+1 



9^ + iP- 9)9 > IfP 



(51) 
(52) 

(53) 



Observe that the /i-index has now vanished and does not appear anywhere in 
(|53)) . From (|53)) . we have, 

9>If- (54) 
In other words, the impact factor can serve as a lower bound for the g-index. 



g = uj (If) . 



(55) 
□ 



Table 1: Number of citations, indices, and generalized impact factor of authors. 



Author 


h 


9 


e2 


e 


P 


If 




A 


4 


7 


23 


5 


17 


4.12 


70 


B 


8 


11 


35 


6 


30 


5.20 


156 


C 


11 


18 


167 


13 


30 


12.07 


362 


D 


12 


20 


198 


15 


52 


9.31 


484 


E 


14 


30 


623 


25 


59 


17.10 


1009 


F 


28 


43 


750 


28 


181 


14.07 


2546 



Table 2: Part of citations from paper numbered {g + 1) to P and their bounds. 



Author 




{P-9)h 9^- 




A 


20 


40 


69 


B 


38 


152 


159 


C 


23 


132 


345 


D 


74 


384 


474 


E 


62 


406 


962 


F 


669 


3864 


2518 



However, this lower bound may not be tight enough, therefore, we call it a 
loose lower bound, and in Lemma [21 it is denoted by the symbol small omega, 
w, which is generally used to denote a loose lower bound. The reason for the 
looseness of the lower bound is that we have assumed in the (|5T|). Ci < g for 
(ff + 1) l£ i l£ P, which is a loose assumption as compared to the assumption 
made in p4)) . 

In what follows, we discuss some results to validate the analysis. 



Table 3: Part of citations from paper numbered h + 1 to g, and from /i + 1 to 
P and their bounds. 



Author 


^P 


{P-h)h 




(g - h)h 


(5 - h)g 


A 


31 


52 


11 


12 


21 


B 


57 


176 


19 


24 


33 


C 


64 


209 


41 


77 


126 


D 


138 


480 


64 


96 


160 


E 


210 


630 


148 


224 


480 


F 


1001 


4284 


332 


420 


645 



Table 4: Bounds on the fe-index. 



Author 


/i- index 


T 

^/ - -P 


^ a 


P-c, 


A 


4 


3 


3 


2 


B 


8 


4 


7 


1 


C 


11 


5 


8 


3 


D 


12 


6 


10 


2 


E 


14 


5 


9 


3 


F 


28 


9 


25 


5 



Table 5: Bounds on the index. 



Author 


g-index 




h + e 


A 


7 


10 


9 


B 


11 


13 


14 


C 


18 


27 


24 


D 


20 


29 


27 


E 


30 


59 


39 


F 


43 


55 


56 




Figure 1: The ft,- index, y-index, e- index, and generalized impact factor of au- 
thors A through F. 
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Figure 2: The /i- index and its lower bounds given by Theorem [1] Theorem [2] 
and Theorem 0] for authors A through F. 
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Figure 3: The g-index and its upper bounds given by Lemma [Hand Theorem [3] 
for authors A through F. 











Citations — i — 
(8-h)h ■ , 
(g-h)g 








































< / 





























1 2 3 4 5 6 

Authors 



Figure 4: The number of citations, X)i=/i+i '^ii ^^"^ i^*^ upper bounds, (g 
and — K)g, for authors A through F. 
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Figure 5: The number of citations, X^iL/i+i '^^i ^^'^ upper bound, (P 
for authors A through F . 
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Figure 6: The number of citations, X^ilg+i "^i; ^^"^ upper bound, (P — g)h, 
for authors A through F. 



4 Results and Discussion 



Table [T] provides different parameters such as the total number of citations, h- 
index, gi- index, e^, generalized impact factor for individual authors. We have 
selected authors from different fieldfU. Note that Table [2] and Table [3] contain 
the values of the number of citations in parts and the values of corresponding ap- 
proximations used for these parts. These values may be useful in understanding 
the differences among the actual values of the indices and their corresponding 
bounds. 

Table |4] shows bounds on the /i-index as described in the previous section. 
The third column of the table represents a lower bound on /i-index. It is clear 



However, the lower bound h — ^ 



is not 



that h > 

very tight. The reason is that while deriving this bound, we have used an 
approximation X^iL/t+i Ci < {P — h)h, which is not a very tight approximation. 
The looseness of the lower bound comes from the fact that we have assumed 
Ci < h for h + 1 < i < P. Practically, Ci << h ior h + 1 < i < P, and that 
introduces the looseness. 

Coming to the fourth column of Table H] that represents a lower bound on 



the /i-index given by /i > 



9-Y 



Note that this lower bound is more tight as 



compared to the lower bound in the third column of this table. For this lower 



■^For example, author A is from Computer Science and Engineering: Soft Computing, 
author B from Computer Science and Engineering: Networks and Communication, author 
C is from Electronics Engineering, author D from Mechanical Engineering, authors E and F 
from Biological Sciences. 



bound, whatever looseness remains is due to the fact that while deriving the 
lower bound we have taken an approximation J2i=h+i — (.9~h)h by assuming 
that each q < /i for (/i + 1) < i < g. Practically, it may happen that ci « h 
for some i's in the range [h + l) <i < g. However, we can say that Theorem [5] 
gives a sufficiently tight lower bound on the /i-index. 

Consider now the fifth column of Table 2] that also represents a lower bound 
on the /i-index. However, this lower bound is the most loose of the three lower 
bounds for /i-index. The looseness of this lower bound is due the approximation 
SiLg+i Ci < {P ~ g)h where we assumed that q < for (g + 1) < i < P. 
Practically, Ci « h for some i's in the range (g + l) < i < P, and this accounts 
for the looseness of the lower bound on /i-index. 

Table [5] shows the values of index together with the values of the bounds 
for the g-index. The third column and the fourth column both represent upper 
bounds on the (/-index. Specifically, the third column represents the upper 

bound on g given hy g — O '*+ir ) ^^'^ fourth column represents the 
upper bound g = 0{h + e). Note that the values of the upper bound given in the 
fourth column are closer to the actual values of the g-index as compared to the 
values of upper bound given in the third column. This observation implies that 
the upper bound for g-index as given by Theorem [3] is more tight as compared 
to that given by Lemma [TJ Any kind of looseness in the upper bound given in 
column three is due the fact that in Lemma [Tl we have used an approximation 
J2i=h+i — {9~h)g assuming that for all i in the range {h + 1) < i < g, Ci < g. 
Practically, it may happen that for some i's, Ci << g, which accounts for the 
looseness of the upper bound on index given by Lemma [TJ In Theorem [3l the 
looseness of the upper bound is due to the approximation, X]f=/i+i '^j — {9~h)h, 
assuming that for each i in the range {h -\- 1) < i < g, Ci < g. Practically, it 
may happen that for some i's, Ci « h, which accounts for the looseness of the 
upper bound on g-index as given by Theorem[3l However, since the assumption 
Ci < ft. is more tight than the assumption Ci < g, therefore, the upper bound 
given by Theorem |3] is more tight as compared to that given by Lemma [TJ 

Figure [T] shows ft- index, gi- index, e-index, and generalized impact factor for 
authors A through i^. There is an observation here, although that is not our 
objective in this paper. The authors are arranged in the increasing order of ft- 
index, we observe that all other ranking parameters are almost in the increasing 
order with the ascending order of ft- index of authors. 

Figure [5] shows the ft-index and its lower bunds given by Theorem [TJ The- 
orem [2] and Theorem [4] for authors A through F. We observe that the lower 
bound given by Theorem [2] is the most tight lower bound among all three lower 
bounds considered in this paper. The reasons for the looseness of these lower 
bounds are, as explained earlier, the approximations used in deriving the expres- 
sions for these lower bounds. For example, the looseness of the lower bound on 
ft as given by Theorem [1] is due to an upper bound used X^iLh+i '^i ^ ~ 
which is based on the assumption that q < ft. for (ft -I- 1) < i < P. The actual 



•^Authors are numbered in all these figures i.e. 1 corresponds to A, 2 corresponds to B, 
and so on, 6 corresponds to F. 



number of citations and the corresponding upper bound, (P — h)h, 

used in Theorem [1] are shown in Figure [S] On the other hand, the lower bound 
on /i-index as given by Theorem [5] uses an approximation "^j ^ (5~ h)h, 

which is based on the assumption that Ci < /i for {h + 1) < i < g. The actual 
number of citations J2i=h+i '^i ^^'^ corresponding upper bound, {g — h)h, 
used in Theorem [2] are shown in Figured Further, the lower bound on /i- index 
as given by Theorem [4] uses an approximation J2i'=g+i c-i 1^ {P ~ g)h, which is 
based on the assumption that Ci < h for {g + 1) < i < P. The actual number of 
citations X^ilg+i and the corresponding upper bound, {P — g)h, used in The- 
orem |4] are shown in Figure [G] As mentioned earlier, the lower bound given by 
Theorem [5] can be considered as a tight lower bound for all practical purposes. 

Figure [3] shows the (/-index and its upper bounds given by Lemma [T] and 
Theorem 13] for authors A through F. We observe that the upper bound given 
by Theorem[3]is more tight as compared to the upper bound given by Lemma[TJ 
The reasons for their looseness are, as discussed earlier, the approximations used 
in bounding the summations of the part of the citations that are involved in 
computing the lower bound. For example, the upper bound given by Theorem[3] 
uses an approximation J2i=h+i — (ff ~ h)h, which is based on the assumption 
that Ci < h for {h + 1) < i < g. The actual number of citations X]f=/i-i-i 
and the corresponding upper bound, {g — h)h, used in Theorem [3] are shown in 
Figure m On the other hand, the upper bound on g-index given by Lemma [T] 
uses an approximation J2i=h+i '^i — (9 ~ ^)9^ which is based on the assumption 
that Ci < (7 for (/i -|- 1) < i < g. The actual number of citations J2i=h+i 
and the corresponding upper bound, {g — h)g, used in Lemma [T] are shown in 
Figure m Note that the approximation {g — h)h is closer to the actual number 
of citations as compared to the approximation (17 — h)g, therefore, the upper 
bound on y-index as given by Theorem [2] is more tight than the upper bound 
given by Lemma [T] 

5 Conclusion 

Finding the relationships among indexing parameters for determining the qual- 
ity of research is a challenging task. In this paper, we describe some inequalities 
relating /i-index, g-index, e-index, and generalized impact factor. We derive the 
inequalities from the very basic definitions of these indexing parameters and 
without assuming any continuous model to be followed by any of them. Also, 
we do not assume any prior model to be plugged in for relating these parameters. 
Further validation of these inequalities forms the future work. 
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